Consed: a graphical tool for sequence finishing.
نویسندگان
چکیده
Sequencing of large clones or small genomes is generally done by the shotgun approach (Anderson et al. 1982). This has two phases: (1) a shotgun phase in which a number of reads are generated from random subclones and assembled into contigs, followed by (2) a directed, or finishing phase in which the assembly is inspected for correctness and for various kinds of data anomalies (such as contaminant reads, unremoved vector sequence, and chimeric or deleted reads), additional data are collected to close gaps and resolve low quality regions, and editing is performed to correct assembly or base-calling errors. Finishing is currently a bottleneck in large-scale sequencing efforts, and throughput gains will depend both on reducing the need for human intervention and making it as efficient as possible. We have developed a finishing tool, consed, which attempts to implement these principles. A distinguishing feature relative to other programs is the use of error probabilities from our programs phred and phrap as an objective criterion to guide the entire finishing process. More information is available at http:// www.genome.washington.edu/consed/consed. html.
منابع مشابه
ReDiT: Repeat Discrepancy Tagger-a shotgun assembly finishing aid
UNLABELLED Finishing, i.e. gap closure and editing, is the most time-consuming part of genome sequencing. Repeated sequences together with sequencing errors complicate the assembly and often result in misassemblies that are difficult to correct. Repeat Discrepancy Tagger (ReDiT) is a tool designed to aid in the finishing step. This software processes assembly results produced by any fragment as...
متن کاملFinishing Drosophila mojavensis Fosmid Clone DMAC-1a
Bio 4342 students are currently working on finishing the Drosophila grimshawi dot chromosome and a euchromatic region of Drosophila mojavensis. Project DMAC-1a (from D. mojavensis) began in two contigs with only a single macroscale problem and a few microscale problems. Using Consed and phredPhrap, this project has been finished to completion and is now in a single contig. This paper presents t...
متن کاملFinishing Drosophila virilis Fosmid Clone 4N16
The overarching goal of Bio 4342/W research over the past several years has been to understand a euchromatic region of the Drosophila virilis genome well enough to be able to distinguish this domain at the DNA level from the heterochromatic counterparts in its genetic relatives, such as Drosophila melanogaster, the common fruit fly. The class will utilize the well-established genome of D. melan...
متن کاملGenome Sequencing and Bioinformatics Analyses of Higher Plants Chloroplasts
Chloroplast DNA in higher plants exist as closed circular molecules of about 150 kb (±30), usually presenting inverted repeat sequences separating two single copy regions [1]. It is available the complete chloroplast genomes of around 13 higher plants species available in the gene bank. Our group has completely sequenced the sugarcane chloroplast DNA which is 141182 nucleotides in size. We have...
متن کاملBACCardI-a tool for the validation of genomic assemblies, assisting genome finishing and intergenome comparison
SUMMARY We provide the graphical tool BACCardI for the construction of virtual clone maps from standard assembler output files or BLAST based sequence comparisons. This new tool has been applied to numerous genome projects to solve various problems including (a) validation of whole genome shotgun assemblies, (b) support for contig ordering in the finishing phase of a genome project, and (c) int...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genome research
دوره 8 3 شماره
صفحات -
تاریخ انتشار 1998